A Bayesian Perspective on Hypothesis Testing
نویسنده
چکیده
In a recent article, Killeen (2005a) proposed an alternative to traditional null-hypothesis significance testing (NHST). This alternative test is based on the statistic prep, which is the probability of replicating an effect. We share Killeen’s skepticism with respect to null-hypothesis testing, and we sympathize with the proposed conceptual shift toward issues such as replicability. One of the problems associated with NHST is that p values are prone to misinterpretation (cf. Nickerson, 2000, pp. 246– 263). Another problem with NHST is that it can provide highly misleading evidence against the null hypothesis (Killeen, 2005a, p. 345): NHST can lead one to reject the null hypothesis when there is really not enough evidence to do so. Killeen’s prep statistic successfully addresses the problem of misinterpretation, and this is a major contribution (cf. Cumming, 2005; Doros & Geier, 2005; Killeen, 2005b; Macdonald, 2005). However, the prep statistic does not remedy the second, more fundamental NHST problem mentioned by Killeen. Here we perform the standard analysis to show that prep can provide misleading evidence against the null hypothesis (cf. Berger & Sellke, 1987; Edwards, Lindman, & Savage, 1963). This analysis demonstrates the discrepancy between Bayesian hypothesis testing and prep, and highlights the necessity of considering the plausibility of both the null hypothesis and the alternative hypothesis. Consider an experiment in taste perception in which a participant has to determine which of two beverage samples contains sugar. After n trials, with s successes (i.e., correct decisions) and f failures, we wish to choose between two hypotheses: H0 (i.e., random guessing) and H1 (i.e., gustatory discriminability). For inference, we use the binomial model, in which the likelihood L(y) is proportional to y(1 y), where y denotes the probability of a correct decision on any one trial. A Bayesian hypothesis test (Jeffreys, 1961) proceeds by contrasting two quantities: the probability of the observed data D given H0 (i.e., y 1⁄4 12) and the probability of the observed data D given H1 (i.e., y 6 1⁄4 12). The ratio B01 1⁄4 pðDjH0Þ=pðDjH1Þ is the Bayes factor, and it quantifies the evidence that the data provide for H0 vis-à-vis H1. Assuming equal prior plausibility for the models, the posterior probability for H0 is given by B01=ð1 þ B01Þ. In the taste perception experiment, pðDjH0Þ 1⁄4 12 n . The quantity pðDjH1Þ is more difficult to calculate, because it depends on our prior beliefs about y. Specifically, when prior knowledge of y is given by a prior distribution p(y), one obtains pðDjH1Þ by integrating L(y) over all possible values of y, weighted by the prior distribution p(y): pðDjH1Þ 1⁄4 R 1 0 LðyÞpðyÞdy. We consider two classes of priors.
منابع مشابه
Bayesian Fuzzy Hypothesis Testing with Imprecise Prior Distribution
This paper considers the testing of fuzzy hypotheses on the basis of a Bayesian approach. For this, using a notion of prior distribution with interval or fuzzy-valued parameters, we extend a concept of posterior probability of a fuzzy hypothesis. Some of its properties are also put into investigation. The feasibility and effectiveness of the proposed methods are also cla...
متن کاملA Bayesian Decision-Theoretic Approach to Logically-Consistent Hypothesis Testing
This work addresses an important issue regarding the performance of simultaneous test procedures: the construction of multiple tests that at the same time are optimal from a statistical perspective and that also yield logically-consistent results that are easy to communicate to practitioners of statistical methods. For instance, if hypothesis A implies hypothesis B, is it possible to create opt...
متن کاملComputational methods for Bayesian model choice
In this note, we shortly survey some recent approaches on the approximation of the Bayes factor used in Bayesian hypothesis testing and in Bayesian model choice. In particular, we reassess importance sampling, harmonic mean sampling, and nested sampling from a unified perspective.
متن کاملA Mathematical Perspective on Gambling
This paper presents some basic topics in probability and statistics, including sample spaces, probabilistic events, expectations, the binomial and normal distributions, the Central Limit Theorem, Bayesian analysis, and statistical hypothesis testing. These topics are applied to gambling games involving dice, cards, and coins.
متن کاملObjective Bayesian Two Sample Hypothesis Testing for Online Controlled Experiments
As A/B testing gains wider adoption in the industry, more people begin to realize the limitations of the traditional frequentist null hypothesis statistical testing (NHST). The large number of search results for the query “Bayesian A/B testing” shows just how much the interest in the Bayesian perspective is growing. In recent years there are also voices arguing that Bayesian A/B testing should ...
متن کاملBayesian Sample size Determination for Longitudinal Studies with Continuous Response using Marginal Models
Introduction Longitudinal study designs are common in a lot of scientific researches, especially in medical, social and economic sciences. The reason is that longitudinal studies allow researchers to measure changes of each individual over time and often have higher statistical power than cross-sectional studies. Choosing an appropriate sample size is a crucial step in a successful study. A st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006